INFORMATION RETRIEVAL WITH NON-NEGATIVE 
MATRIX FACTORIZATION 


ABSTRACT OF THE INVENTION 

Disclosed is a method of indexing a database of documents, comprising providing 
a vocabulary of n terms, indexing the database in the form of a non-negative nxm index 
matrix V, wherein each of its m columns represents an j*^ document having n entries 
containing a function of the number of occurrences of a i^^ term of said vocabulary 
appearing in said document, factoring out non-negative matrix factors T and D such 
that TD, and wherein T is an « x r term matrix, Dismrxm document matrix, and r 
< nml{n-^m). The index so generated is useful in two-pass information retrieval systems. 
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